Statistical Data Privacy: A Song of Privacy and Utility

نویسندگان

چکیده

To quantify trade-offs between increasing demand for open data sharing and concerns about sensitive information disclosure, statistical privacy (SDP) methodology analyzes release mechanisms that sanitize outputs based on confidential data. Two dominant frameworks exist: disclosure control (SDC) the more recent differential (DP). Despite framing differences, both SDC DP share same problems at their core. For inference problems, either we may design optimal associated estimators satisfy bounds risk measures, or adjust existing sanitized output to create new statistically valid estimators. Regardless of adjustment, in evaluating utility, inferences from mechanism require uncertainty quantification accounts effect sanitization introduces bias and/or variance. In this review, discuss foundations common DP, highlight major developments SDP, present exciting research private inference.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy and Security of Big Data in THE Cloud

Big data has been arising a growing interest in both scien- tific and industrial fields for its potential value. However, before employing big data technology into massive appli- cations, a basic but also principle topic should be investigated: security and privacy. One of the biggest concerns of big data is privacy. However, the study on big data privacy is still at a very early stage. Many or...

متن کامل

Privacy and Security of Big Data in THE Cloud

Big data has been arising a growing interest in both scien- tific and industrial fields for its potential value. However, before employing big data technology into massive appli- cations, a basic but also principle topic should be investigated: security and privacy. One of the biggest concerns of big data is privacy. However, the study on big data privacy is still at a very early stage. Many or...

متن کامل

An Axiomatic View of Statistical Privacy and Utility

“Privacy” and “utility” are words that frequently appear in the literature on statistical privacy. But what do these words really mean? In recent years, many problems with intuitive notions of privacy and utility have been uncovered. Thus more formal notions of privacy and utility, which are amenable to mathematical analysis, are needed. In this paper we present our initial work on an axiomatiz...

متن کامل

Privacy vs. Utility in Anonymized Data

We investigate the privacy and utility aspects of k-anonymity, which has received much research attention since its introduction in [Sweeney, 2002]. Meyerson and Williams [2004] showed that finding an optimal k-anonymization is NP-hard and developed a first approximation algorithm. Further algorithms with different approximation guarantees have been proposed, but it remains hard to compare thes...

متن کامل

A Survey of Utility-based Privacy-Preserving Data Transformation Methods

As a serious concern in data publishing and analysis, privacy preserving data processing has received a lot of attention. Privacy preservation often leads to information loss. Consequently, we want to minimize utility loss as long as the privacy is preserved. In this chapter, we survey the utility-based privacy preservation methods systematically. We first briefly discuss the privacy models and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Annual review of statistics and its application

سال: 2023

ISSN: ['2326-8298', '2326-831X']

DOI: https://doi.org/10.1146/annurev-statistics-033121-112921